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Abstract 

We analyze the presumptions which lead to instabilities in theories 
of order higher than second. That type of fourth order gravity which 
leads to an inflationary (quasi de Sitter) period of cosmic evolution by 
inclusion of one curvature squared term (i.e. the Starobinsky model) 
is used as an example. The corresponding Hamiltonian formulation 
(which is necessary for deducing the Wheeler de Witt equation) is 
found both in the Ostrogradski approach and in another form. As an 
example, a closed form solution of the Wheeler de Witt equation for 
a spatially flat Friedmann model and L = R 2 is found. The method 
proposed by Simon to bring fourth order gravity to second order can 
be (if suitably generalized) applied to bring sixth order gravity to 
second order. 



PACS numbers: 0320 Classical mechanics of discrete systems: general 
mathematical aspects; 9880 Cosmology; 0450 Other theories of gravitation 
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1 Introduction 



It is quite general belief that curvature squared terms, if added 

to the Einstein - Hilbert action, describe semiclassically quantum correc- 
tions to General Relativity. Further, there is no doubt that the existence 
of an inflationary period (exponential expansion of the cosmic scale factor) 
solves a lot of problems connected with the standard big bang model of the 
universe. So there is no wonder, that the Starobinsky model - curvature 
squared terms lead automatically to the desired inflationary period - enjoyed 
so much interest in recent years. Now, Simon and others formulated some 
reasons speaking against the Starobinsky model, the main reason is the fact 
that the field equation underlying the Starobinsky model is of fourth order. 

It is the aim of the present paper to analyze those arguments, which 
are connected with higher ( = higher than second) derivative theories. The 
80-pages article [1] entitled "The problem of nonlocality in string theory" 
discusses in its set. 2 the "fundamental problems of nonlocality through the 
higher derivative limiting procedure". The principal result of its subsection 
2.1. is that at least iV — 1 of the solutions of a nondegenerate theory of 
order 2N carry negative energy. Eliezer and Woodard write: "The energy 
is therefore unbounded below for all nondegenerate, higher derivative theo- 
ries". This leads to the instability observed in almost all fourth and higher 
order theories. The Starobinsky cosmological model follows from fourth order 
gravity, and so it seems to be 

a candidate for such an unstable theory, see e.g. ref. [2] which is entitled 
"No Starobinsky inflation from self-consistent semiclassical gravity". We 
analyze that part of the arguments which is connected with the higher order. 
To this end we specialize the Ostrogradski approach [3] (which is a method 
to bring a higher order Lagrangian into Hamiltonian form - more recent work 
on this topic can be found in ref. [4]) to fourth order theories in set. 2 and 
give some intuitive examples. In set. 3 we discuss the question, whether 
fourth order theories lead to a minimum or only to a saddle point of the 
action. In set. 4, a method different from Ostrogradski's one is proposed to 
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bring fourth order equations in a Hamiltonian form. 

Then we are prepared to consider the Starobinsky model [5] in set. 5. 
The main problem comes from the i? 2 -term, so we simplify in set. 5.1. by 
discussing the high-curvature limit and derive the corresponding Wheeler de 
Witt equation by the method described in set. 4. Set. 5.2. discusses the 
question of the superfluous degrees of freedom of fourth order gravity and 
set. 5.3. gives the Starobinsky model in form of a power series not yet found 
in the literature. Set. 6 is on sixth and higher order gravity. It is included 
to show which kind of problems additionally appear, if L = R + co-R 2 -gravity 
is intended to be the k = O-truncation of a power series 



We look for the Newtonian limit of that theory and generalize Simon's ap- 
proach [2] to this Lagrangian (1.1) truncated at k — 1. 
Set. 7 discusses the results. 



2 Ostrogradski's method for a fourth order 
system 

We follow Ostrogradski [3] but use the notation published in ref. [1] which is 
more familiar to the present reader, and we specialize always to fourth order 
theories which follow from a nondegenerate Lagrangian of second order. So 
we consider a 1-dimensional point particle with position q(t) at time t. A 
dot denotes j- and the Lagrangian is of the type 



where q G /, / ^ being a connected open subset of the space R of all reals, 
and q, q are allowed to cover all the reals. The momentum P 2 is defined by 



k 



L = R + Y.CiR^R 



(1.1) 



L = L(q, q, q) 



(2.1) 




(2.2) 
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In [1], the Lagrangian L is denned to be nondegenerate if this eq. (2.2) 
can be solved for q which takes place, loosely speaking, iff ^ 7^ 0. To 
avoid discussions for the case that (2.2) can be solved, but not uniquely, we 
additionally require that 

f = 

Under this circumstance the Lagrangian is nondenegerate if and only if F 
does not have any zeroes, i.e., it is a map F : I x R — > R\{0}. This we 
shall presume in the following. Then the Lagrangian (2.1) can be written as 

L = \(q) 2 F{q, q) + q G(q, q) + K(q, q) (2.3) 

To avoid discussions of differentiability, we simply require the three functions 
F 7^ 0, G, K to be real analytic ones. Then eq. (2.2) becomes 

P 2 = qF(q,q) + G(q,q) (2.4) 

and it can be uniquely inverted to 

a=\ f G ^ (2.5) 

The Euler - Lagrange equation following from eq. (2.1) reads 

5L _dL d dL d 2 dL _ 
5q dq dt dq dt 2 dq 

Subsequently we write = q and g( n+1 ) = q( n \ Inserting eq. (2.3) into eq. 
(2.6) we get an equation of the structure 

= q^F{q^\ q^) + J{q^\ q& , q^ , q^) (2.7) 

where J is a real analytic function composed of F, G, and K. From eq. (2.7) 
the notion of nondegeneracy becomes apparent: The second order Lagrangian 
(2.1) is nondegenerate iff the Euler - Lagrange equation is a regular fourth 
order equation. The fact that we restricted the domain of q to the subset 
/ of R is no real restriction because by a real analytic redefinition q(q) we 
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could get I = R. To get a Hamiltonian for this system, one needs a first 
order formulation with two coordinates. We define them as 

Q 1 = Q, Q 2 = q (2.8) 
The two conjugate momenta are 

dL d dL 
1 dq dt dq 

and P 2 defined by eqs.(2.2)/(2.4). The Hamiltonian H = H(Q\Q 2 , P ± , P 2 ) 
is obtained via 

2 

H = -L + J2 PnQ n (2.10) 

n=l 

With these definitions the canonical equations 

<T = — P=-™- (211) 
W 8PJ n dQ" [ } 

take place, and their validity implies the validity of the Euler - Lagrange 
equation (2.6). Now the arguments of F, G, K are (Q 1 , Q 2 ) and we get 

Q 1 = Q 2 , Q 2 = (P 2 - G)/F (2.12) 

and after some calculus 

H = P,Q 2 + ^(P 2 - G) 2 - K (2.13) 

where dH/dt = follows from eq. (2.6). So, H can be called the energy 
of the system. The essential point is that the energy is unbounded both 
below and above. This is directly seen from eq. (2.13) because if is a linear 
function in Pi. Remark: In ref. [1], it was argued that the problem lies in the 

fact that energy is unbounded below. More exactly one should say: unbounded 
below and above; supposed, energy is unbounded below and bounded above, then 
we simply change the signs of both L and H and get the energy bounded below. 
This is possible because no sign of H is preferred a priori - in contrast to classical 
mechanics where the sign of H is defined by the condition that kinetic energy is 
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non-negative. H = const, represents a first integral of eq. (2.6). It is, as it 



must be the case, a third order equation for q(t), and it has the structure 

H = —q^q^F + lower order terms (2-14) 

There is a singular point at q — 0. The definition eq. (2.13) of H is essentially 
(i.e. up to invertible linear transformations of L and H which do not change 
the dynamics) unique, because time-independent canonical transformations 
do not change it. That H is unbounded both below and above can be seen 
from eq. (2.14): Fixing the initial values q, q ^ 0, q, one can freely choose 
q^ (cf. eq. (2.7)) and gets H as unbounded. The instability following 
from H being unbounded below and above can be described as follows. A: 
In the particle picture one gets particles with positive and particles with 
negative energy. Then the unlimited production of pairs of such particles 
is not prevented by energy conservation. B: In the four-parameter set of 
solutions of the Euler - Lagrange equation one gets a subset of dimension 
> 1 of solutions with negative energy. Let us make this last point more 
explicit. To this end we first consider what happens if we add such a total 
derivative to the Lagrangian that the functional dependence does not change. 
This is done by 

L = L + ^-[M - Et] (2.15) 

where E is a constant and M depends on q and q. One gets 

dM ~ dM , 

so that the condition of invertibility of P 2 does not change, and 

F = F,H = H + E (2.17) 

So this transformation, too, does not change the properties discussed. Let 
us continue with the discussion of negative energy solutions. We assume 
that q = is a solution, and we fix E such that q = is a zero-energy 
solution. Remark: In the first order Lagrangian L = ^q 2 — ^q 2 one has p = q 
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and H = \p 2 + ^q 2 . For A = 1, one has the solutions q = suit and q = cost 
which both have energy H = i. For A = — 1, however, one has the solutions 
q = sinht and q = cosht which have energy H = \ and H = — \ resp. They sum 
up to the zero energy solution q = ex.pt. This is the instability meant. For the 

second order Lagrangian we consider only the terms up to second degree in 
the arguments. The terms q, q, qq, qq and qq + q 2 represent total derivatives, 
and we use them to bring the general form to 

L = \f + \q 2 ~ f <1 2 (2.18) 

The corresponding Hamiltonian becomes 

H=\q 2 + ^q 2 + ^q 2 -qq { * ) (2.19) 

The Euler - Lagrange equation reads 

= q W -Aq-Bq (2.20) 

The momenta are 

P l = Aq-Q®, p 2 = q 

. For A = B = one gets the positive energy solution q = t 2 and the negative 
energy solution q = t 3 + 1 . For B = 0, A = ±1, the solution 

q = at + (3s(t) + 7c(t) 

where s(t) = sinht for A = 1, s(t) = sint for A = — 1, analogously c(t), has 
the energy 

H = ±l(a 2 -P 2 ) + I r 2 
. For A = 0, B = 1, the general solution of eq. (2.19) reads 

q = a sin t + f3 cos t + 7 sinh t + 5 cosh t 

One gets 

H = a 2 + [3 2 - 7 2 + 5 2 
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The general case shows similarly that both signs of the energy appear. For 
non-linear equations, of course, the solutions do not simply add, but the 
behaviour of the signs of the energy is similar. 



3 Minimal, not only stationary action 

Instability may occur, if the action is not minimal, but only stationary. We 
shall check, whether this type of instability can occur in the second order 
Lagrangian discussed. More on this topic, especially applied to fourth order 
gravity, can be found in ref. [6]; however, the point here is only to convince 
the reader that requiring minimality of the action does not trivially rule out 
fourth order theories. To do so, we develop the action S[q + eh] defined by 

S\q] = [ T Ldt (3.1) 
Jo 

into powers of e, where L is the same as in eq. (2.1), and T > 0. Without 
loss of generality, the initial point of time was put t — 0. Let h be any 
differentiate function fulfilling h(0) = h(0) = h(T) = h(T) = 0. After 
partial integration and use of the notation eq. (2.6) we get 

r T SL f 2 

S[q + eh] = S[q] + e h— dt + -V[q,h] + 0(e 3 ) (3.2) 
Jo oq 2 

where 

T,r ,i r T ,2& L ; 2 d 2 L V2 d 2 L d 2 L n d 2 L ■ d 2 L , 
V[q, h] = / h 2 — + h 2 — - + h 2 — - + 2hh— — + 2hh— — + 2hh——.dt 

Jo oq 2 oq 2 oq 2 oqoq oqoq oqoq 

In dealing with V, partial integration does not help. So one should it dis- 
cuss directly. Remark: Before discussing the fourth order case, let us repeat the 

behaviour for the harmonic oscillator L = ^q 2 — \q 2 ■ One gets 

rT 

V= h 2 -h 2 dt 
Jo 
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One only needs the boundary conditions h(0) = h(T) = 0. From the first glance, 
V seems to possess only a saddle point, but a Fourier analysis with 

oo 

h = a n sm.{nirt/T) 

n=l 

gives 

T 00 -K 2 n 2 

^ = f E(^--« 

n=l 

For T < 7T, this is positive definite, for T = tt, it is positive semidefinite, and only 
for T > 7T it becomes a saddle point. The harmonic oscillator is the standard 
example of a stable model, so one should require the action to be locally minimal, 
i.e., minimal if considered over sufficiently short but finite time intervals. Let us 

now come to the Lagrangian eq. (2.18). Inserting it into eqs. (3.1), (3.2) we 

§ et T 

V = [ h 2 + Ah 2 - Bh 2 dt (3.3) 
Jo 

We perform the same Fourier analysis than in the previous example and get: 
The maximally allowed value T depends on A and B, but it is always positive, 
so that one has the same kind of stability here: If the time interval considered 
is sufficiently short, then each stationary point of the action represents a 
minimum. 



4 Another Hamiltonian formalism 

Besides Ostrogradski's approach discussed in set. 2, there exists another 
possibility to get a Hamiltonian from a higher order Lagrangian. It has the 
advantage that the relation from classical mechanics 

<"> 

remains valid, whereas Ostrogradski changed it to eq. (2.9). 
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Further possibilities to get a Hamiltonian are discussed in ref. [7] (see 
also the references cited there), the difference is that in [7], there is always 
a constraint, whereas we look for a method where no additional constraint 
must be introduced. 

Again, we start from eq. (2.1) and concentrate on Lagrangians of the 
type (2.3). The difference is now that the new coordinates are chosen to be 

q 1 = q, q 2 = q (4.2) 

So the dependence of L^q 1 , q 1 , q 2 , q 2 ) is unique, whereas by use of eq. (2.8), 
there is an ambiguity between Q l and Q 2 . To show up the procedure we find 
it more appropriate to concentrate on one single Lagrangian; the general 
procedure might become clear from it. Moreover, it is just that Lagrangian 
which will appear in set. 5. So we take 

L = (q) 2 e 3q (4.3) 

To prevent an identical vanishing of p 2 according to eqs. (4.1), (4.2), (4.3), 
we add a suitable total derivative to the Lagrangian (4.3). Let us first take 

(4.4) 

i.e., an equation to be used in set. 5.1. 

l = [(g) 2 + mf + m 4 ]e 3q 

In a second step we take 

L = L-2j t {qqe^) 

i.e., 

L = -[(g) 2 + 2qq^ + 6{q) 2 q}e 3q (4.7) 
We insert eq. (4.2) into eq. (4.7) and get 

L = -\(q 2 ) 2 + 2q l q 2 + 6(q l ) 2 q 2 } exp(3g 1 ) (4.8) 
10 



(4.5) 



(4.6) 



where we use q n = -j t (q n )- Applying (4.1) we get 

Pi = -[2<f + 12gV] exp(3g 1 ), p 2 = -2q x exp(3g 1 ) (4.9) 
The Jacobian is 

g^T^j =~4exp(3g ) (4.10) 
This differs from zero, and so we can invert eq. (4.9) to 

q 1 = -^expiSq 1 ), q 2 = [3p 2 q 2 - -pi] exp(-3g x ) (4.11) 

We get from eqs. (4.8)/(2.10) with the help of eqs. (4.11) the following 
Hamiltonian 

H = -\{pm - V 2 q 2 ) exp(-3g 1 ) + (q 2 ) 2 exp(3g 1 ) (4.12) 

It is essential to observe that the equation q 1 = q 2 follows from the canonical 
equations of H (4.12) without imposing it as additional constraint. So the 
equations (4.2) become automatically compatible. 

To give some feeling for Hamiltonians with negative kinetic energy we 
give six typical examples - hoping that this gives better insight than general 
formulations do. Example 1 Let k be a parameter fulfilling < k < 1. We 
consider the Lagrangian 

L = (qf - 2(q) 2 + kq 2 (4.13) 
for a 1-dimensional point particle q(t). The Euler - Lagrange equation reads 

= g (4) + 2q + kq (4.14) 

We insert the ansatz 

q = e xt , = A 4 + 2A 2 + k (4.15) 
into eq. (4.14) which leads to 

\ = ±i \/l±Vl-k (4.16) 
11 



representing four different purely imaginary numbers. Therefore, the general 
solution of eq. (4.14) can be written as 

2 , 

?(*) = £ c„sin(t n + tVl + (-l)Vl-A;) (4.17) 

n=l 

where ci, C2, h, ti are the four integration constants. Each solution is boun- 
ded in time. In the limiting case k — 0, the unbounded function q(t) = t 
is a solution. In the other limiting case k — 1, q(t) — tsint is also an 
unbounded solution. Example 2 Let e be a parameter fulfilling < e < 1. 

Let 

H = \p 2 + \q 2 + \P 2 + \q 2 + eqQ (4.18) 

be a Hamiltonian for two 1-dimensional point particles q, Q (or, equivalently, 
one 2-dimensional particle with coordinates (q, Q) ); p is the momentum 
corresponding to q, P to Q. Because of the restriction put on e, H is positive 
definite in all its arguments. For e = 0, this is nothing but two independent 
harmonic oscillators of frequency 1. Only the term ~ e introduces some 
interaction. The canonical equations following from eq. (4.18) are 

9H . dH „ . . 

-K- = q = p, -K- = -p = q + tQ = -q 4.19 

op oq 

flTT PjTT 

dP=® = P ' dQ = - P = Q + eq = ~Q (420) 

To integrate the system it proves useful to eliminate Q by use of eq. (4.19) 
as follows 

Q = -\{q + q) (4-21) 

This leads with eq. (4.20) to 

= g (4) + 2q + (1 - e 2 )q (4.22) 

With k = 1 — e 2 we meet exactly the system (4.14) from example 1. The 
result of example 1 is in agreement with the KAM- theorem which applies 
to the system considered here and states that there exists an interval of 
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positive e-values, such that the corresponding system is solved by torus-like 
(i.e., periodic) solutions. Each arbitrarily small value e 

gives rise to a bifurcation of the frequency according to |A| in example 1. 
Each solution is periodic and, hence, bounded. In the limiting case e — > 
corresponding to k — ► 1, the equivalence of example 2 to example 1 breaks, 
because for e = 0, all solutions remain bounded here. That this equivalence 
breaks as e = 0, becomes also plausible from the relation (4.21) between Q 
and q. H can be considered to be the energy of the system. Let us express 
it as function of q and its derivatives alone: (Calculations have been done by 
REDUCE 3.41) 

2He 2 = q[q + 2q}{\ - e 2 ) + [g( 3 )] 2 + 2qq^ + (qf + (g) 2 (l + e 2 ) (4.23) 

A direct calculation leads to 

e 2 ^ = Q[q^+2q+(l-e 2 )q] (4.24) 

hence, H = const, follows from the g-equation (4.22), but H = const, implies 
a solution for Q ^ only. Let us mention that the fourth order equation 
of example 1 is equivalent to the positive definite Hamiltonian of example 
2. Example 3 Let us start again from the system of example 1, but now 

we apply the Ostrogradski approach to make a Hamiltonian from it. New 
coordinates are Q 1 = q, Q 2 = q, see eq. (2.8), new momenta are 

see eqs. (2.9), (2.2). The Hamiltonian is 

H = -L + j2 P n Q n = P^Q 2 + -P 2 + 2(Q 2 ) 2 - k{Q v ) 2 (4.26) 

71=1 4 

Now, let us forget about the origin of H and calculate the canonical equations. 
Inserting Q 1 = q, we get after some calculus Q 2 — q, P\ — — 4g — 2q < - 3 \ P 2 = 
2q and finally 

= g (4) + 2q + kq (4.27) 
13 



i.e., as it should be, just eq. (4.14) of example 1. We insert the values for 
P n , Q n onto H and get 

H = (q) 2 - 2gg {3) - 2(g) 2 - kq 2 (4.28) 

This is also a conserved quantity: 

^ = -2q [ q W + 2q + kq] (4.29) 

but H is not evidently bounded. Clearly, each solution remains bounded, 
but the set of solutions for a fixed value H need not to be bounded: let us 
insert the solution (4.17) of example 1 with t\ — t 2 — 0, then one gets 

H = 2e[c 2 2 {\ + e) - c\{l - e)} (4.30) 

H can take each real value, and with arbitrarily fixed value H, we can find 
an unbounded set of functions solving for just this H. Let us compare this 
result with the analogous calculations in example 2: There one gets from the 
same initial conditions 

# = c 2 (l + e) + c 2 (l-e) (4.31) 

Up to the inessential prefactor 2e, it is just the other sign in front of c\ 
which makes the difference. Here, H > 0, and the set of solutions for a 
fixed value H forms a compact set of bounded functions, moreover: it is a 
uniformly bounded set of functions. So the conserved quantity H should not 
be considered as the energy of the system, because H better meets the point. 
This is another argument against using the Ostrogradski approach. Let us 
further mention, that the Poisson bracket of these two conserved quantities 
H, H identically vanishes, so it does not give rise to a further conserved 
quantity. 

Example 4 We take now the same H as in example 3, but we interchange 
coordinates and momenta (Pi — > q, Q 1 — >■ —p, P 2 — > Q, Q 2 — ► — P). 

H = 2P 2 -kp 2 -qP+^Q 2 (4.32) 
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Here, the kinetic energy is indefinite, but the solutions are, of course, also only 
the periodic ones of example 1. Here, we see that two different Hamiltonians 
may describe the same system, one has indefinite, the other has definite 
kinetic energy. On the other hand, a regular Hamiltonian must have a non- 
vanishing Jacobian 

d(q,Q) _d 2 H d 2 H d 2 H) 2 

d(p, P) dp 2 dP 2 { dpdP ) 1 ' ' 

Here, J < 0. (This J is the inverse to the Jacobian used in eq. (4.10); 
this does not matter since only the sign of J is essential here.) A one- 
parameter family of regular canonical transformations connected with the 
identity transformation cannot change the sign of J. Therefore, the Hamil- 
tonians H (example 2) and H (example 4) cannot be continuously deformed 
into each other by such a transformation, because in example 2, H is positive 
definite, and J > 0. Nevertheless, they describe the same system. Example 

5 Let us take the Hamiltonian 

H = Pp+ e -(q 2 + Q 2 ) + qQ, 0<e<l (4.34) 

Both the kinetic and the potential part are indefinite. The canonical equa- 
tions give 

P = q, p = Q, P = -q- e Q, p = -Q - eq (4.35) 

After some calculus we get 

= q (4) + 2q + (1 - e 2 )q (4.36) 

which is again the previously discussed system. Example 6 Now we start 

from a Lagrangian which differs from example 2 only in two changes of a 
sign. 

H* = \p 2 + \q 2 - \P 2 - \Q 2 + eqQ, 0<e<l (4.37) 

So, both kinetic and potential energy are indefinite. But as was seen in 
example 5, this does not exclude the equivalence. Let us first consider the 
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limiting case e = 0. Here, again, it is fully equivalent to the second example: 
There is no interaction between the two oscillators, and there is no a priori 
sign preferred for the energy. H and H* are two conserved quantities, whose 
Poisson bracket vanishes. The situation drastically changes if we come back 
to e > 0. One of the presumptions of the KAM-theorem is no more valid, and 
so we expect qualitatively different solutions for arbitrarily small values e. In 
the particle picture one can imagine the following: The spontaneous creation 
of pairs of particles, one with positive, the other with negative energy, is 
energetically allowed, and it should take place with a typical doubling time ~ 
1/e. For fixed energy, arbitrarily large momenta are possible. We perform the 
calculations analogous to the previous ones. We can prevent any calculations 
if we look at H and H*: Multiplying P and Q by i and multiplying e by (— i), 
one is changed into the other. So, clearly, the other formulas are valid if e 2 
is replaced by (— e 2 ). Then the dynamics follows from 

= g {4) + 2q + kq, k = 1 + e 2 (4.38) 

and it is example 1 with k > 1. The corresponding fourth order polynomial 
for A is then solved by 

, e e 2 

\ = ±iVT±k = ±-±i{l + -)+0(e 3 ) (4.39) 
2 8 

The four solutions correspond to the four combinations of the signs "±". 
Therefore, the general solution can be written as 

2 2 

?(f) = E c n eMt(-m^ + 0(e 3 ))]sm[t n + t(l + e - + 0(e'))] (4.40) 

where ci, C2, t\, £2 are the four integration constants. (By the way, the 
^( e3 ) = lu + 0(e 2 )]e 3 for this formula.) q(t) = is the only bounded 
solution, and for C2 7^ 0, one gets an exponential increase as expected. This 
is, of course, a resonance effect. If, on the other side, H* is altered by a 
suitable positive factor in front of p 2 , then for small e, the general solution 
remains to be periodic, but the periods are mixed. 
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5 The Starobinsky model 



In ref. [5], Starobinsky proposed to use 

L = (f-£*V=5 (5.D 

as gravitational Lagrangian. Here, R is the curvature scalar, g the deter- 
minant of the metric of space-time, and / is a length being somehow in the 
region I = 10~ 28 cm. 



5.1 The high-curvature limit 

Let us first consider the high-curvature limit 

l = LtfJZT g (5.2) 
36 

For the metric of a spatially flat Friedmann model 

ds 2 = dt 2 - e 2q{t) {dx 2 + dy 2 + dz 2 ) (5.3) 

we get 

R = -Qq - 12(g) 2 , g = -e 6<? (5.4) 
and the Lagrangian becomes 

L= [q + 2(q) 2 ] 2 e 3<1 (5.5) 

which coincides with eq. (4.5). Now we could apply both the Ostrogradski 
approach set. 2 as well as the approach of set. 4. We prefer to use the latter 
one because of the validity of eq. (4.1), but for comparison we write down 
both of them. Let us first apply the Ostrogradski approach to eq. (5.5). 
Looking at eqs.(4.5), (4.4), (4.3) one can see that (up to a divergence) we 
have to consider 

L = (qfe 3q (5.6) 
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Applying the formalism of set. 2 we get 



Q=q, 



Q 2 = q, 



Pi = - 



2^(ge 3<? ), P 2 = 2qe^ 



and then 



iJ = P 1 Q 2 + i(P 2 ) 2 exp(-3Q 1 ) 



(5.7) 



i.e., 



H = e 3q [(qf-2q(q (3) + 3qq)} 



(5.8) 



As is to be expected, the canonical equations to the Hamiltonian H eq. (5.7) 
give again the original system, where H eq. (5.8) represents a conserved 
quantity. Moreover, one knows that the gravitational field equation (here 
especially its zero-zero-component) forces H to vanish. This can easiest be 
shown by making the ansatz dt 2 = N 2 (t)cIt 2 and putting N — 1 only after 
the variation (and not before as we did). Let us now come to the Wheeler de 
Witt equation (i.e., the zero energy Schrodinger equation of a cosmological 
model) for this system. In units where h — 1 it is obtained via substituting 
P n by id n = i-^K- Applying this to eq. (5.7) one can see that the fact that 
Pi is contained linearly and not quadratically gives as consequence that one 
of the coefficients 

of the Wheeler de Witt equation fails to be real. A third reason speaking 
against this approach is the fact, that Q 2 = q (which is just the Hubble 
parameter of the cosmological model eq. (5.3)) is not invariantly defined, it 
changes its sign by a change of the time direction. Now we try the same with 
the formulas of set. 4. In fact, we can work directly with eqs. (4.3), (4.6 
- 4.12). If we re-insert eqs. (4.2), (4.9) into eq. (4.12) we get exactly the 
eq. (5.8). So no second conserved quantity appears here, and the Wheeler 
de Witt equation is derived as follows. The material from eq. (5.9) till eq. 
(5.13) is taken from ref. [8], and from eq. (5.14) till eq. (5.15) is taken ref. 
[9]. The Lagrangian L eq. (4.8) has the structure 



(5.9) 
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where g^ depends on the q l only and the Einstein sum convention is applied. 
One gets 

dL 

Pl = W = 9i ^ 3 (5 ' 10) 

To apply the Hamiltonian formalism, it is necessary to invert eq. (5.10) such 
that the velocities are written in dependence of coordinates and momenta. 
This is possible if and only if g^ is an invertible matrix. This takes place for 
the case considered here: A comparison of (4.8) with (5.9) gives 

gn = -6g 2 exp(3g 1 ), g 12 = - exp(3g 1 ), #22 = 

Let g % i be the inverse matrix to gij. Then eq. (5.10) can be inverted to 

<)' .</''/', (5.11) 

In the interesting case (4.8) this gives 

g 22 = Qq 2 exp(-3g 1 ), g 12 = - exp(-3g 1 ), g 11 = 

The Hamiltonian becomes 

H = p % q % ~L = \g ij PiPj + V{q*) (5.12) 

and here, V = (q 2 ) 2 exp(3g 1 ). If we quantize now by substituting p n by i-^ 
then the procedure is no more covariant, and the factor ordering problem 
appears. In classical mechanics this problem is absent, because g^ is a con- 
stant matrix. We cirumvent the problem by substituting p n by iV„ where V n 
denotes the covariant derivative into (/"-direction with respect to the metric 
g^. Then the Wheeler de Witt equation reads 

= (□ - 2V)ijj(q i ) (5.13) 

where ip is the world function and 

□ = V,V l = -^diy/^g^dj 

is the D'Alembertian. For our example we get 

= [3d 2 q 2 d 2 - did 2 - 2{q 2 ) 2 exp(6g 1 )]^ (5.14) 
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To simplify, let us apply the following transformation 

3 

a = exp(-3g 1 ), r = -q 2 exp(3g 1 ) 

This transformation explicitly brings g^ to the flat form g ar = 1, g aa = 0, 
g TT = 0. The Lagrangian becomes 

L = —ar — r 2 cr 
a 

The Hamiltonian is correspondingly 

H = a[7T T 7r a + r 2 ] 
and the Wheeler de Witt equation reads 

0=[d T d a -r 2 ]^(a,r) . (5.15) 
This linear differential equation can be solved in closed form by 

/oo 
a(A)exp(A(x + — )dX (5.16) 
-oo oA 

where the amplitude function a, a(0) = 0, can be arbitrarily chosen both as 
continuous as well as a sum of (^-functions. 



5.2 Superfluous degrees of freedom 

We look at higher ( = higher than second) order gravity theories under the 
point of view that the higher order yields more degrees of freedom than is to 
be expected. Further results on higher order gravity and inflationary phase 
of cosmic evolution can be found in refs. [10 - 24] (a list which is not in- 
tended to be representative but essentially contains the papers we refer to 
in the subsequent text). Let us start with some historical comments taken 
from ref. [14]. In [25], Weyl proposes a new theory which is intended to 
unify gravitation with electromagnetism. He both generalizes the Rieman- 
nian geometry underlying the General Relativity theory of Einstein to a non- 
integrable theory (introduction of the Weyl vector) as well as the Einstein - 
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Hilbert Lagrangian (which is linear in curvature) to a Lagrangian quadratic 
in curvature. At that point Weyl already realized, [25, p. 477]: "Dies hat zur 
Folge, daB unsere Theorie wohl auf die Maxwellschen elektromagnetischen, 
nicht aber auf die Einsteinschen Gravitationsgleichungen fiihrt; an ihre Stelle 
treten Differentialgleichungen 4. Ordnung." (Translation: This has the con- 
sequence, though our theory leads to Maxwell's electromagnetic equations, 
it fails to lead to Einstein's gravitational ones; instead of them, 4th order 
differential equations appear.) We cite this sentence to show that already 
in 1918 the possibility of fourth order gravitational field equations had been 
discussed as alternative to General Relativity. One year later, Pauli [26] cal- 
culated the static spherically symmetric solutions of Weyl's theory; he got 
the result that the Schwarzschild solution is a solution for all the equations 
following from one of the three Lagrangians 

R 2 , RikR lk : RijkiR 1 ^ 1 (5-17) 

(R is the curvature scalar, R^ the Ricci tensor, Rijki the Riemann tensor.) 
Pauli assumed the Weyl vector to be zero, so that he had a Riemannian 
structure of Lorentz signature as underlying geometry. He concluded that 
measurements of Mercury perihelion advance and light deflection in the field 
of the Sun which are in agreement with General Relativity are also in agree- 
ment with all the variants of Weyl's theory, but the fourth order theory 
has too many ambiguities (both in finding the correct Lagrangian as well as 
choosing the correct solution); more explicitly this is done in [27]. Today one 
can say more generally: Each vacuum solution of Einstein's equation is also 
a vacuum solution of each of the variants of fourth order gravity (where Pauli 
[26] believed this to be the case for the first two expressions in eq. (5.17) 
only) . Pauli [26] wrote about the superfluous degrees of freedom, that they 
are a consequence of the fact that he only considered the vacuum equations 
and that it should be possible to cancel them by finding the correct interior 
solution at the source. The latter is only a mathematical problem; we pro- 
ceed on this line in set. 6.1. Further, he assumes that the far-field of a mass 
m can 
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be developed into powers of m/r, where r is the distance from the center 
of the source. The last point we want to repeat from the old papers is the 
following: R has dimension < length >~ 2 , and therefore, the action 

J R m ^d n x (5.18) 

(where g is the determinant of the metric in the n-dimensional space-time) 
is scale-invariant (i.e., does not change by a change of the used length-unit) 
if and only if n = 2m holds. For the usual case n — 4, this gives m = 2, an 
argument which was already used by Weyl in 1918. Now let us come back 
to Simon's argument [2] that the superfluous degrees of freedom have to be 
cancelled: Surely, he has found one possibility, but that one is a priori not 
better than the following ones. In units where 8irG = c = 1 we use the 
Lagrangian 

L = (-R+ -k 2 C 2 - —l 2 R 2 )^g- (5.19) 
v 2 4 12 ;v y K J 

where 

C 2 = C ijkl C^ kl (5.20) 

is the square of the Weyl tensor and can be written as linear combination of 
the terms in eq. (5.17). The Lagrangian (5.19) gives rise to a tachyonic-free 
theory if and only if both k 2 > and I 2 > hold. (In principle, k 2 and 
I 2 may have both signs, we prefer the nontachyonic case.) It holds (see ref. 
[28], and ref. [29] for the presentation used here): If we redefine the original 
metric g^ to Gy via 

dj = gij - 2k 2 Rij + ^{k 2 - l 2 )R 9lj (5.21) 

then the linearized Einstein tensor of G^ vanishes if and only if g^ solves the 
linearized fourth order 

equation following from eq. (5.19). For microscopically small lengthes k 
and / both metrics cannot be distinguished by experiment, and so one is free 
to use G^ as physical metric possessing the required second-order dynamics, 
at least on the linearized level. The second possibility is the following: For 
the spatially flat Friedmann model the typical solution of fourth order gravity 
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is (cf. e.g. refs. [19] and [20]): Damped oscillations around the expansion law 
of the Einstein de Sitter model. This is not only due to the high symmetry: In 
ref. [30] there is considered the general anisotropic Bianchi type I model, and 
the result was the same. The general behaviour with inhomogeneous models 
is not known, but there exist reasons (by the conformal transformation of 
fourth order gravity to Einstein's theory with a minimally coupled scalar 
field, see e.g. ref. [31], where flat space is related to a local minimum of the 
potential) to believe that there are similar typical solutions. We interpret 
them as follows: The superfluous degrees of freedom are just the phases of 
the oscillations, and by the damping of the amplitudes they simply disappear. 

5.3 Starobinsky inflation as power series 

In this section we consider in more details than can be found in the literature: 
In which sense the Starobinsky inflationary solution can be developed in a 
power series ? To this end we make the following consideration (which makes 
more explicit what has been done in ref. [20], set. 5, especially [20], eq. 
(18)). It is essential to note that there is neither a cosmo logical term nor an 
additional inflation or scalar field - all inflation comes from the R 2 -term in 
the Lagrangian (5.19). The exact inflationary de Sitter space-time is defined 
by eq. (5.3) with q(t) = ht, h having a constant positive value. In general, 
h = q is the Hubble parameter. The quasi de Sitter stage is that period, 
where \dh/dt\ <C h 2 . Now, the field equation following from Lagrangian 
(5.19) is considered. The Friedmann model is conformally flat, and so, the 
term with the Weyl tensor identically vanishes. So, without loss of generality 
we put k = 0. Further, we consider the non-tachyonic case I 2 > only. 
Suen [17, 18] showed that flat space is unstable and that the Starobinsky 
solution is stable. A partial stability of flat space with respect to initially 
expanding perturbations can be found, however: in [20] it was shown: All 
vacuum solutions representing an expanding spatially flat Friedmann model 
of the field equation following from the Lagrangian (5.19) with I 2 > can 
be integrated up to infinity; they all have the same asymptotic behaviour: 
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Damped oscillations with frequency l/l about the Einstein de Sitter model 
q = t 2 / 3 . The spatially flat Friedmann model has the metric (5.3). We take 
the Lagrangian (5.19) and get a fourth order differential equation for the 
metric. However, the 00-component of the differential equation is a constraint 
and gives a second order equation for h which reads 

Linearization of this equation gives = 0, so that it is clear that the linearized 
equation gives no information about the full one. It holds: Each solution of 
(5.22) is also a solution of the other 9 components of the field equation; 
however, each function h solves the linearized equation but in general not 
the linearized trace equation which reads simply 

§+'- 2 f- <w 

Next, it is clear that eq. (5.22) has a singular point at h — 0, and so the 
numerical integration has to be done with care. The best method to integrate 
the system numerically is the following: One uses the constraint only at the 
initial moment and then one integrates the trace equation; the trace is regular 
even for h = 0. Also the existence of oscillations with frequency l/l becomes 
clear from (5.23), and the sign of the r.h.s. of eq. (5.22) decides whether the 
oscillations are damped or not. But the result of [20], that for I 2 > and 
initial value h > the equation (5.22) can be integrated up to infinite time t, 
is strong and does not depend on the numerics, and it does not change if we 
include classical matter like dust or radiation. In eq. (5.22) the inflationary 
period can be found by requiring that the first two items are negligible in 
comparison with the third one. (Afterwards it will turn out that the first two 
terms remain finite whereas the third one tends to infinity as h — ► oo; so 
this approximation is consistent.) We get the first step of the approximation 
by removing the first two terms of eq. (5.22); this leads to the equation 

dh 1 

~dt ~ ~oi 



(5.24) 
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The larger the value h , the better the approximation (5.24). This justifies 
to use a Laurent sequence in h 2 as general ansatz as follows 

f = -^E(-i) i ft(«)" 2i ( 5 - 25 ) 

We included such powers of the length / as factors that the coefficients Qi 
become real numbers. Comparing (5.24) with (5.25) one gets go = 1/6. The 
motivation of the factor (—1)* will become clear afterwards: all numbers Qi 
will turn out to be positive. Just for the same reason we did not write odd 
powers of 1/h in (5.25), because all their coefficients automatically vanish 
if we insert the sequence into eq. (5.22). This is very satisfactory because 
even powers of 1/h correspond to powers of h whereas the odd powers would 
correspond to Vh~ , which is a less natural quantity. The coefficients ^ can 
be obtained as follows: h times the derivative of eq. (5.25) gives 

h M = ^£ i _ 1)i2igi{hl) _ x (5 26) 

Now we insert eqs. (5.25), (5.26) into (5.22), multiply by I 2 and get step by 
step 

dh 00 

^[6h 2 l 2 + £(-iy(4* + l) gi (hl)- 2i ] = ~h 2 (5.27) 
at i=0 

After division by (— h 2 ) and some rearrangement we get 

oo oo k— 1 

J2(-l) k 6g k (hiy 2k - J2(-l) k (hl)- 2k J2(^ + l)9i9k-i-i = 1 (5-28) 

fc=0 fe=l i=0 

The absolute value of eq. (5.28) gives again g = 1/6 , and for each k > 
we get 

0* = £ E( 4i + l )9i9k-i-i (5.29) 

" i=0 

e.g. g\ = 1/6 3 = 1/216, g<i = 1/6 4 , g% = 65/6 7 . The next natural step 
seems to be the insertion of (5.29) into the ansatz (5.25). But it turns out 
that one gets the result more quick by integrating that equation which is ob- 
tained from (5.27) after division by h 2 (time-translation is only a coordinate 
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transformation, so we get no essential constant of integration): 

00 47 + 1 

Qf h + Y,{-l) i+1 g i ---h- 1 {hl)- 2i = -t (5.30) 

i=0 Zl + L 

This equation can be inverted as follows: 

k = -a* - s +§><('/')" < M1 > 

with certain dimensionless constants . With eq. (5.31) we solve eq. (5.22) 
and insert the result into the metric (5.3). To simplify the expressions, we 
perform the coordinate transformation t — l- T. Then the metric describing 
the Starobinsky inflation reads 

00 

ds 2 = l 2 [dr 2 - exp(-r 2 /6)|r|- 1 / 3 ^g i r^(rfx 2 + dy 2 + dz 2 )} (5.32) 

i=0 

where q — 1 and the other are certain real constants. Metric (5.32) gives 
a good presentation in the region — 00 < t <C — 1. One can see that it 
would not have been found by a simple guess, e.g., by a Fourier or Laurent 
sequence in t or so. How to come from (5.29) to the analogous expressions 
for the fi and the ^ is straightforward analysis, and the convergence of the 
sequences can be proved; from the line after eq. (5.29) it becomes at least 
quite plausible. In the presentation (5.32) it is not immediately clear that this 
is inflation; for r <C —1, the parabola — r 2 /6 is an almost linearly increasing 
function, so that the cosmic scale factor is almost exponentially increasing, 
because the other terms do not essentially change the picture. Let us now 
come the main question here: what happens for I — > ? In metric (5.32), 
all metric coefficients can be developed into powers of I 2 , moreover, they are 
quadratic polynomials in I. However, for I — > 0, the metric degenerates. 
This is essentially the argument of Simon [2], that Starobinsky inflation is 
not selfconsistent in semiclassical gravity. One should look whether this 
effect depends on the special coordinates chosen. To this end we go back to 
synchronized coordinates t = rl. Then the factor exp(— t 2 /Ql 2 ) brings the 
problem (besides the third root of I in the next factor), whereas the further 
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sequence is a sequence in I 2 . This is in agreement with the fact, that for 
/ = 0, the corresponding field equation has only the flat Minkowski space- 
time as solution. The recent view of the Starobinsky model can be found e.g. 
in refs. [41], [42] and a more geometrically oriented review in [43]. 



6 Sixth and higher order equations 

In this section, we consider gravitational field equations of order higher that 
fourth; this is mainly done to show, how the fourth order Starobinsky model 
is situated between the Einstein theory and the sixth and higher order ones. 

6.1 The Newtonian limit 

The Newtonian limit is the slow-motion approximation of the linearized field 
equation. In this limit, the fourth order field equation following from (5.19) 
becomes tractable. For a 5-source of mass m one gets 

ds 2 = (1 - 2§)dt 2 - (1 + 26)(dr 2 + rW) (6.1) 

where dVl 2 denotes the metric of the unit S 2 , 

m 4 i 
$ = -[1 - - exp(-r/fc) + - exp(-r/0] (6.2) 
r 6 3 

see [32], and 

m 2 1 
= -[!-- eM-r/k) - - exp(-r/Z)] (6.3) 

see [15]. It is essential to observe that the solutions (6.2, 6.3) are unique. 
One should notice: Inspite of the higher order of the differential equation 
one needs the same restriction (namely, the vanishing of $ and 9 as r tends 
to infinity) to get a unique Newtonian limit. We have considered the same 
question for a class of gravitational field equations of arbitrary high order 
and got the same result [21] for the tachyonic-free case. We used 

i=0 0<j o <ji<...<ji<p m=0 
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where p > and < l < h < ■ ■ ■ < l p are characteristic lengthes and □ 
denotes the D'Alembertian. (For comparison: eq. (5.19) with k = and eq. 
(6.4) withp = coincide.) For eq. (6.4) the Newtonian limit gives (6.1) with 

* = -[l + \ E(-l) i+P nil" II" 1 exp(-rA)] (6.5) 

r 6 i=0 j+i l i 

and $ + 9 = — . For an extended mass distribution the result is the same 

r 

because of the linearity, and for the full nonlinear equations one can conjec- 
ture that at least in the vicinity of flat space the result remains the same. 
There is an essential point of departure from the Pauli-approach mentioned 
before and the calculations here: For L = R 2 (ref. [33]) and also for the 
other purely quadratic Lagrangians (i.e. linear combinations of the terms in 
eq. (5.17), see refs. [34, 35]) one does not get the correct Newtonian limit 
unless one adds the Einstein-Hilbert Lagrangian to the action. (Remark: 
In [16] and [36] the same problem is considered with the same Lagrangian 
but another variation (Palatini's one which gives the same theory for the 
Einstein-Hilbert Lagrangian only), i.e., independent variation with respect 
to metric and affinity; the result agrees not only with respect to the fact 
that the Einstein-Hilbert Lagrangian must be added, but also with respect 
to the general Newtonian plus Yukawa-type of the potential.) Let us end this 
section with a further point of departure from Pauli's approach [26]: He (and 
the authors of [37] and [38] too) required the outer solution to be developable 
in powers oim/r. But then only the Schwarzschild solution appears which 
is definitely not the outer solution for a point mass. And neither eq. (6.2) 
nor (6.3) can be developed in powers of 1/r. 

6.2 Generalization of Simon's approach to higher or- 
der gravity 

In the units chosen here (8irG = c = 1) the Planck length l Pt is related to 
Planck's constant via h = . So, Simon's expansion [2, 11] into powers 
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of h is equivalent to an expansion into powers of I 2 , where I is a fixed length. 
Let us take as example the Lagrangian (6.4) which was already considered 
in [22] and [39] for p — 1 and in [23] for general p. Eq. (6.4) can be written 

as 

L = j-^fcd 2l ^R (6.6) 
z iZ i=0 

with numerical constants Cj . We suppose c p ^ 0, and (6.4) leads to a field 
equation of order 2p + 4. (We do not need it in extent here; one can find it 
in [23], eq. (8).) For simplicity, we consider the vacuum equations only. We 
show that by the method of Simon, the order can be reduced to 2p + 2 as 
follows: We suppose (6.4) to be the truncation of an infinite sequence, and so 
it is valid only up to corrections of order 0(/ 2p+4 ). Then we multiply the field 
equation following from (6.4) by 1 2 p +2 with the result that only the term from 
the Einstein-Hilbert part of the Lagrangian survives; all other contributions 
can be subsumed to another 0(l 2p+A ). So we get 

= R t] l 2p+2 + 0(/ 2p+4 ) (6.7) 

For p = 0, this is just eq. (5.3) of ref. [11]. We can form the covariant 
derivatives of (6.7), multiply it by R and form traces. Then it is possible to 
add such a linear combination of these equations to the field equation that, 
up to terms of the order 0(l 2p+A ), all terms stemming from c p RO p R in eq. 
(6.6) are compensated and the field equation reduces to the order 2p + 2. 
For p = this coincides with Simon's approach. If the higher order terms 
in the Lagrangian do not contain derivatives of the curvature, then the field 
equation is of fourth order in each step; this has been analysed in [12]; there 
it is also mentioned that Starobinsky inflation remains a consistent solution if 
one interprets fourth order gravity as classical theory and not as semiclassical 
one. This point of view (see also [13]) is compatible with Stelle's result [40] 
that fourth order gravity, if taken as classical theory, becomes - in contrast 
to Einstein's theory - renormalizable. But here we have chosen an example 
where the order of the differential equation is increased step by step. The 
next question which is interesting for p > is whether the procedure can 
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be repeated such that the order can be reduced from 2p + 2 to even lower 
order, one should expect that it must be order 2 at the end. The simplest 
non-trivial example is p — 1, where, after the first step described above, the 
following fourth order equation appears 

R I 2 R 2 
= % - - 9ij - - [DR 9ij - R. :ij + R Rij - — 9ij ] + 0(f) (6.8) 

where the semicolon denotes the covariant derivative. The essential difference 
between eq. (6.8) and eq. (1.1) of [11] is now the power (here 0(Z 6 ), there 
0(/ 4 )) of the remainder. The trace of (6.8) reads 

= R + l 2 nR + 0{l 6 ) (6.9) 

We apply l 2 D to eq. (6.9) and get 

= l 2 DR + l 4 DDR + 0(l 6 ) (6.10) 

The same done with (6.10) yields 

= l 4 UUR + 0{f) (6.11) 

The sum of eqs. (6.9) and (6.11) minus eq. (6.10) yields 

= R + O{l 6 ) (6.12) 

Similarly one can handle the trace-free part of eq. (6.8). So we have shown 
that (at least this type of) sixth order gravity can be brought to second 
order by Simon's approach. But one should mention that we have, as Simon 
did, made such assumptions that the application of Z 2 D does not change the 
power of the general remainder. This is a consistent assumption because l 2 
is a dimensionless operator. By inclusion of matter, one gets then covariant 
derivatives up to the fourth one of the energy-momentum tensor (instead of 
second derivatives found in [11], eq. (5.4)). The corresponding calculation 
is straightforwardly done, so we do not write out the formulae. (Also, they 
are not so essential here, because in regions where the higher order terms 
are dominant, one usually believes that matter is not yet essential for the 
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dynamics.) Let us sketch them: The l.h.s. of eq. (6.7) becomes, in analogy 
to eq. (5.3) of [11], Kl 2p+2 (Tij — ^Tg^), where is the energy-momentum 
tensor and T its trace. The l.h.s. of eq. (6.8) gets the form n[Tij + Z 2 (DT+ 
similar terms)], and then two further covariant derivatives to the l.h.s. appear 
similar as those one in eqs. (5.4)/(5.5) of ref. [11]. 



7 Discussion 

The Starobinsky model goes back to early ideas of J. B. Zeldovich and A. D. 
Sakharov, see e.g. ref. [44], where the addition of higher curvature terms to 
the Einstein - Hilbert action was intended to mimic quantum gravitational 
effects; it was hoped that these terms can prevent the initial singularity. 
Another approach can be found in ref. [45], where the stress tensor renor- 
malization of quantized matter fields in a classical background metric lead 
to curvature squared terms in the effective action with spin-dependent cal- 
culable coefficients in front of them. A third approach was performed by 
Stelle [40], who showed that the Lagrangian (5.19) leads to a renormalizable 
theory of gravity; the coefficients in front of the curvature squared terms 
are not calculable, but should be measured. We distinguished these three 
approaches explicitly, because they are often mixed. The Starobinsky model 

follows from the Lagrangian eq. (5.1), which coincides with eq. (5.19) if 
k = 0. This is not a renormalizable theory of gravity, and it shares this 
property with Einstein's General Relativity Theory (GRT). One instability 
of the theory following from eq. (5.19) comes from the fact, that for k 2 < 0, 
tachyons appear and for k 2 > 0, ghosts appear (the latter are particles with 
negative kinetic energy). The Starobinsky model, however, contains neither 
tachyons nor ghosts. A further instability can appear if there is no mini- 
mum of the total energy of a given local system. In Einstein's GRT this 
is prevented by the well-known positive energy theorem, whereas eq. (5.19) 
with k 7^ allows an analogous theorem only in a very restricted sense, cf. 
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ref. [46]. For the theory following from eq. (5.1), however, a positive energy 
theorem, analogous to that one in GRT, is valid, see [47]. It needs as only 
additional presumption that R < 3/~ 2 . This represents no practical restric- 
tion because / is microscopically small and the inflationary period of cosmic 
evolution is connected with negative values of R. Connected with this fact 
is the point discussed in set. 3: requiring minimality of the action does not 
rule out fourth order gravity. A third instability could occur if one looks at 
eq. (5.1) as perturbation of Einstein's GRT, I playing the role of the small- 
ness parameter. This is, of course, a singular perturbation, and usually, one 
would expect quickly increasing solutions to appear. In general, this takes 
place, but under the special circumstances met here, this does not happen. 
This has its origin in the special kind of nonlinearity of the singular differen- 
tial equation (5.22): it has the property that for each initial condition with 
h > (i.e. initially, the universe expands), the system can be integrated up 
to infinite time, and there it tends to the corresponding solution of Einstein's 
GRT. (We made this explicit here in set. 5.3. because of statements found 
in refs. [17, 18] which seem to contradict this, but in fact, only use another 
notion of instability.) This regular behaviour of the solutions can also be 
seen in the Newtonian 

limit, see set. 6.1. If one looks at the solutions eqs. (6.2)/(6.3) one can 
see that they converge to the corresponding Newtonian potential as k, I — > 
but they cannot be developed into powers of k and I. So the problem of the 
superfluous degrees of freedom can be solved by stating that in the weak-field 
region, the coefficients of these terms are unobservably small. Another way 
to deal with the superfluous degrees of freedom is carried out by Simon in 
[2, 10]. Also in [1], page 408 there it is pointed out, that the Starobinsky 
model is not more unstable than Einstein's theory itself. These stability 
statements are all compatible. To see this, one has to remember that for 
initially contracting perturbations, both Einstein's theory and fourth order 
gravity yield a big bang- type instability after finite time. 

The instability appearing from the fact that a fourth order equation can 
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be brought to a Hamiltonian with indefinite kinetic energy, see refs. [1, 4], 
was analyzed in detail in set. 2. We showed at some typical examples that 
this can lead to instabilities, but it need not to do so. We proposed another 
general approach to bring a fourth order theory to Hamiltonian form in set. 
4. The advantages of our approach are also listed there. The Hamiltonian 
form of the theory is needed to deduce the Wheeler de Witt equation of the 
system. In set. 5.1. we made it for the high-curvature limit - and there the 
Wheeler de Witt equation could be solved in closed form, eq. (5.16). It is 
planned to make the analogous calculations for less symmetric space-times 
and the theory including the i?-term, i.e., for the Lagrangian (5.1). For the 
interpretation of them cf. e.g. ref. [48]. But for the problem discussed here 
one only needs the sign of the kinetic energy in the Hamiltonian formulation; 
it is the same as the signature of the superspace metric as is clear from eq. 
(5.12). In ref. [49], the following was shown: The signature S (=number of 
negative eigenvalues) of the superspace metric leading to the Wheeler de Witt 
equation following from Einstein's GRT equals S = l+s(n— s), where n is the 
dimension of the spatial part (usually = 3) of the space-time metric and s its 
signature (s = both for the Lorentzian as well as for the Euclidean signature 
of the underlying (n+l)-dimensional manifold). So S — 1 for Einstein's GRT 
and usual signature, which has the consequence that the Wheeler de Witt 
equation is a normal hyperbolic wave equation. What it essential here: The 
fact that for fourth order gravity the Hamiltonian formulation leads to an 
indefinite kinetic energy (superspace metric signature equals 1) is a property 
which it has in common with GRT. A discussion of the R + i? 2 -theory in 
connection with topological defects can be found in ref. [50]. Let us finally 
make some remarks what happens if one adds some higher order terms, e.g. 
those one discussed in set. 6, especially the Lagrangian (6.4) with p > 1 
leading to the order of the differential equation > 6. Then the problems 
become more serious. The superspace metric gets signature > 2, so the 
Wheeler de Witt equation is no more normally hyperbolic. The conformal 
transformation to Einstein's theory with several scalar fields (see [22] for 
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p — 1 and [23] for p > 1), leads always to ghosts. The hope that sixth order 
gravity naturally leads to models with double inflation is not fulfilled, see ref. 
[39]. It is unclear yet whether eighth order gravity (partial results can be 
found in ref. [51] - further work is in progress) can solve these problems. So 
the proposal by Simon [2] to reduce fourth order gravity seems practicable to 
be generalized (as we did in set. 6) to reduce sixth and higher order models 
to second order. 
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inflation as a power series", the calculations are correct, but the interpreta- 
tion of the solution as a non-singular one is wrong. To elucidate the origin 

of that error we give two lemmata. The coordinates t, x, y, z shall cover 
all the reals, and a{t) shall be an arbitrary strictly positive monotonously 
increasing smooth function defined for all real values t. ("smooth" denotes 
C°°-differentiable.) Then it holds 

Lemma 1: The Riemannian space defined by 

ds 2 = dt 2 + a 2 (t)(dx 2 + dy 2 + dz 2 ) 

is geodesically complete. This is well-known and easy to prove; however, on 

the other hand it holds Lemma 2: The Pseudoriemannian space defined by 

ds 2 = dt 2 - a 2 (t)(dx 2 + dy 2 + dz 2 ) (7.13) 
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is light-like geodesically complete iff 

,o 

/ a(t) dt = oo (7.14) 

("iff" denotes "if and only if".) The proof is straightforwardly done by 

considering light-like geodesies in the x— t-plane. Moreover, lemma 2 remains 
valid if we replace "light-like geodesically complete" by "light-like and time- 
like geodesically complete". (Lemma 2 seems to be unpublished up to now.) 

Let us now turn to the scope of this erratum: For the Lagrangian [1, eq.(5.1)] 

I R I 2 \ 

L — I R 2 ) J=g where / > 

\2 12 J v y 

one gets a fourth-order field equation; one of its solutions is described in [1, 
eq.(5.32)]. In the region t <C —I, that solution can be approximated by eq. 
(1) with 

a(t) = exp(-^) (7.15) 

However, this solution does not fulfil the condition eq. (2). Therefore, by 
lemma 2, Starobinsky inflation does not represent a light-like geodesically 
complete cosmological model as has been frequently stated in the literature, 
including in ref. [1]. To prevent a further misinterpretation let me reformu- 
late as follows: Inspite of the fact that the Starobinsky model is regular (in 
the sense that a(t) > for arbitrary values of synchronized time t), every 
past-directed light-like geodesic terminates in a curvature singularity (i.e., 
\R\ — > oo) at a finite value of its afnne parameter. Therefore, the model is 
not only geodesically incomplete in the coordinates chosen, moreover, it also 
fails to be a subspace of a complete one. 

Let me add two remarks: 1.: Eq. (1) with a(t) = exp(Ht), H being a positive 

constant, is the inflationary de Sitter space-time. According to lemma 2, it is 
also incomplete. However, contrary to the Starobinsky model, it is a subspace 
of a complete space-time. 2.: This erratum has no further consequences for 
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the Starobinsky model. 

[1] H.-J. Schmidt, Phys. Rev. D 49, 6354 (1994). 



38 



